HDFS | 3.2.1 | Apache Hadoop Distributed File System |
YARN | 3.2.1 | Apache Hadoop NextGen MapReduce (YARN) |
MapReduce2 | 3.2.1 | Apache Hadoop NextGen MapReduce (YARN) |
Hive | 3.1.1 | Data warehouse system for ad-hoc queries & analysis of large datasets and table & storage management service |
HBase | 2.2.2 | Non-relational distributed database and centralized service for configuration management & synchronization |
ZooKeeper | 3.5.6 | Centralized service which provides highly reliable distributed coordination |
Ambari Metrics | 0.1.0 | A system for metrics collection that provides storage and retrieval capability for metrics collected from the cluster |
Ranger | 2.0.0 | Comprehensive security for Hadoop |
Flink | 1.13.5 | Apache Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. |
Kerberos | 1.10.3-30 | A computer network authentication protocol which works on the basis of 'tickets' to allow nodes communicating over a non-secure network to prove their identity to one another in a secure manner. |
OCEANBASE | 3.1.2 | An opensource distributed relational database |
Clickhouse | 19.3.6 | open source distributed column-oriented DBMS. |
Dlink | 0.6.0 | Apache Flink is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. |
Dolphin Scheduler | 1.3.8 | 分布式易扩展的可视化DAG工作流任务调度系统 |
Apache Doris | 0.11.0 | Apache Doris |
Elasticsearch | 7.2.0 | Indexing and Search |
Grafana | 5.2.4 | Dashboard |
Impala | 3.2.0 | an open source, analytic MPP database for Apache Hadoop that provides the fastest time-to-insight |
Kudu | 1.13.0 | A new addition to the open source Apache Hadoop ecosystem, Apache Kudu completes Hadoop's storage layer to enable fast analytics on fast data. |
Kyuubi | 1.5.0 | Apache Kyuubi, a distributed and multi-tenant gateway to provide serverless SQL on lakehouses |
Presto | 0.303 | Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes. |
Redis | 5.0 | Redis is an in-memory data structure store, used as database, cache and message broker |
Spark2 | 2.4.8 | Apache Spark is a fast and general engine for large-scale data processing. |
Sqoop | 1.4.7 | Tool for transferring bulk data between Apache Hadoop and structured data stores such as relational databases |
Tez | 0.9.2 | Tez is the next generation Hadoop Query Processing framework written on top of YARN |
ZEPPELIN | 0.8.0 | A web-based notebook that enables interactive data analytics. It enables you to make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more. |